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An  Almost  Sure  Representation  for  Intermediate 
Order  Statistics:  The  Finite  Fndpoint  Case 


by  Vernon  Watts* 


Introduction. 


Let  { X } , be  a sequence  of  independent  and  identically  distributed 

n n_>l 

random  variables  on  (i2,  F,  P)  with  marginal  distribution  function 

F(x)  = P(Xj  < x) . Let  {kn)n>j  be  a rank  sequence  of  integers,  that  is, 

1 < k < n for  each  n,  and  denote  by  the  k -th  smallest  order  statistic 

n 

of  the  sample  , ....  X . Suppose  that  F(x^)  = X,  for  a given  X in  the  open 
unit  interval  (0,  1),  and  in  a neighborhood  (x^  - 6,  x^  + 6)  the  second  deriv- 
ative F"  exists  and  is  bounded,  and  assume  that  F' (x^)  > 0.  Let 
n 

F (x,  to)  = n"1  y T,„  .be  the  empirical  distribution  function  of  X X . 

n ’ .L.  IX. <x]  1 n 

Then  for  the  sequence  of  central  order  statistics  {X^n^}  , where 


n n>l 


k^  = nX  + otn-’  log  n) , Bahadur  (1966)  showed  that 


(1.1) 


xjw  <«) 

n 


nFn(xX’  u) 
nF' (x^) 


+ (to) , 


where  R (to)  = (?(n~  “*  log  n)  as  n with  probability  one.  The  precise  order 
n 

of  the  remainder  term  R^  was  found  by  Kiefer  (1967).  From  the  representation 
(1.1)  may  be  obtained  important  limiting  properties  of  central  order  statistics, 
including  asymptotic  normality  and  the  law  of  the  iterated  logarithm.  This  rep- 
resentation has  been  extended  to  central  order  statistics  from  dependent 
sequences  (Xn)  by  Sen  (1968,  1972),  and  under  fewer  assumptions  upon  the  mar- 
ginal d.f.  Fa  weaker  convergence  of  R^  was  obtained  by  Ghosh  (1971). 

In  this  paper  we  establish  an  analogous  representation  for  intermediate 

order  statistics  {X^},  that  is,  when  the  rank  sequence  {k^}  satisfies  -*■  °° 
k n 

but  n -*•  0.  ({k  } is  then  called  an  intermediate  rank  sequence.)  For  sim- 

n 

plicity  we  deal  only  with  the  case  of  i.i.d.  sequences  {X^},  although  extensions 
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to  dependent  or  non- ident i cal ly  distributed  sequences  are  possible.  Our  assump- 
tions upon  the  marginal  d.f.  I'  parallel  those  of  Bahadur  and  involve  the  behavior 
of  f near  its  left  endpoint  x , defined  by  x^  = infix:  F(x)  > 0},  which  here  we 
assume  to  be  finite.  Additionally,  we  obtain  from  our  representation  the 
asymptotic  normality  property  and  a law  of  the  iterated  logarithm  for  inter- 
mediate order  statistics. 


2.  Statement  and  proof  of  the  main  result. 

Let  F be  a d.f.  with  a finite  left  endpoint  xq  such  that  F(xo)  = 0 (that 

is,  F has  no  probability  mass  at  x ) . Suppose  that  inside  an  interval 

(x  , x + 6)  F is  twice  differentiable  with  F"  bounded,  and  that  lim  F'(x) 

xfx 

exists  and  is  positive.  Let  {k^}  he  an  intermediate  rank  sequence;  we  impose 

k 

the  mild  restriction  that  -,—2- — ->  00  . We  then  determine  x by  the  relation 

log  n n 

k 

Ffx^)  = — ; the  value  x^  is  well-defined  and  unique  for  all  sufficiently  large 
n,  by  the  assumptions  upon  F. 

Again  denote  the  empirical  d.f.  of  Xj  , ...,  by  F^Cx.w).  In  this  section 
we  prove  the  following  result. 


Theorem  1 . Suppose  that  { } is  an  i.i.d.  sequence  with  marginal  d.f.  F 
satisfying  the  above  left  endpoint  conditions,  and  let  {k^}  be  an  intermediate 
rank  sequence  for  which  log  n = o(k^)  as  n ->  oo.  Then 


where 
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_ l 1/  3/ 

R (to)  = 0( n k 4 log  4 n)  as  n -*■ 
n n 


with  probability  one. 

Throughout  we  suppose  that  { } and  {k^}  are  given  sequences  of  r.v.'s 
and  integers,  respectively,  for  which  the  hypotheses  of  Theorem  1 are  satisfied. 
Define  a sequence  of  positive  numbers 


a = n”1  k ^ (1  + e)*2  log^2  n 
n n & 


where  £ > 0,  and  a sequence  of  integers  {b^}  satisfying 


kn  K 

bn  ~ (Iog“n  } as  n " "* 


For  sufficiently  large  n let 


G (x,  to)  = F (x,  to)  - F (x)  - F (x  , to)  + — 
n nv  v nv  n’  n 


Hn(to)  = sup  | Gn(x>  <*>)  | . 

x -a  <x<x  +a 
n n n n 


The  following  two  lemmas  constitute  the  major  portion  of  the  proof  of 


Theorem  1 . 


Lemma  2.1.  With  probability  one. 


- 1 V V 

Hn(co)  = 0(n  k^  4 log  4 n)  as  n -*• 
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Proof.  Let  Nj  be  an  integer  such  that  is  uniquely  defined  for  n _>  Nj.  For 

n > N.  and  for  any  integer  r.  let  n = x + a b *r . By  the  monotonicity  of 
— 1 1 b r.nnnn 

F and  F we  have  for  x e [n  , n n ] , 
n r,n  r+1 ,n 


f (n  , <*0  - F(n  ) < f (x,  w)  - f (x)  < f (n  , w)  - F(n  ). 
n r,n  r+l,n  — n — n r+l,n’  r,n 


H (oi)  £ max  £ | G (n_  _ , w)  | } 

-b  <r<b  ’ 

n n 


+ max  {F(nr+l  n]  ' F(nr  n)} 
-b  <r<b  -1  r 1,n  r,n 


(2.2) 


K (cj)  + a , 
n n 


say.  Since  n - n =ab1,  nu  =x+a->-x,  and  by  the  assump- 

1 r+l,n  r,n  n n bn,n  n n o’  J 1 

tions  upon  F in  the  fixed  interval  (xq,  xq  + 6),  we  have  that 


(2.3) 


an  = 0(n'  log^  n) 


We  will  use  the  following  inequality  of  Bernstein  (see  Bahadur  (1966)): 
Let  B(n,  p)  he  a binomial  r.v.  with  parameters  n and  p,  for  some  integer  n > 1 
and  for  0 < p < 1.  Then  for  every  t > 0, 


(2.4) 


P(  | B(n,  p)  - np|  _>  t)  < 2 exp  (-h) , 


where  h = h(n,  p,  t)  is  given  by 
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n = 

2{np(l  - p)  + j max{p,  1 - p}} 

Let  C,  > lim  F' (x)  > 0.  We  may  choose  N2  _>  Nj  such  that 
xlxo 

F(xn  + an)  - F(xn)  < C1  an  and  F(xn)  - F(xn  - an)  < Cj  an>  for  all  n > 
Now  for  any  n _>  N and  any  integer  r. 


g (n  ) = 

n r,n 


-1 


I i - Pv  , if  r > 0 


,L.  [x  <X.<n  ] Fr,n  ’ 

i=l  L n l—  r ,nJ 


-n 


-1 


.1  '[n  <X.<x  1 ‘Pr.n  • 

1=1  1 r,n  l—  nJ 


k y 3/ 

where  Pr  n = |F(nr  n)  - jp  I ■ Letting  tn  = C2  n" Akn  4 log  4 n for  some  > 0, 
we  therefore  have  by  (2.4), 

P(lGn(nr,n}|  - V-  2 GXp  (-hr,n>’ 


where 


2.  2 

n t 

n 


r,n 


!{"Pr,„(I  ' pr.n>  * ^ ra"tPr,„'  1 


P H 
rr  ,n 


nt  2 
n 


- 2 p + t 
*r,n  n 


Since  n > and  |r|  < bn  imply  Pr  n < Cj  an>  we  have 


P‘IG„<\.„)I  i V^2  e,1P  (-v 


for  n > and  |r|  <_  bn>  where 


9 


n 2 C,  a + k ’ 
1 n n 


which  does  not  depend  on  r.  Therefore 


P(K  > t ) < y P ( | G (n  ) I > t ) 
n — n — 1 n r,n  ' — n 


< 4 b exp  (-6  ) . 
— n r n 


An  easy  calculation  shows  that  6 _>  - ^7 log  n f°r  n ^ N„  , say, 

n 2 C (1  + e)  /2  + 1 3 

so  that  for  given  C,  and  e,  choosing  G,  sufficiently  large  such  that 

c ^ 

> |,  we  obtain 

2 Cj  ( 1 + e)  '2  + 1 


y P(K  > t ) < °° 
L n — n 


Thus  by  the  Borel-Cantelli  lemma. 


P(Kn  _>  tn  infinitely  often)  = 0, 


, y y 

and  since  t = C,  n k 4 log  4 n,  the  statement  of  the  lemma  follows  from 
n z n 


(2.2)  and  (2.3). 


I.emma  2.2.  With  probability  one, 


x - a < x5n)  (to)  < x + a 
n n — k — n n 
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for  all  large  n (that  is,  for  n > some  N(w)).  Note  that  x < x - a for  all 

— o n n 

large  n,  by  the  choice  of  a^,  the  conditions  on  F,  and  the  restriction 
log  n = o(kn) . 

Proof.  First  we  have 


P(X^n)  < x - a ) < P(  V I 
k n n — L 


[X. <x  -a  ] - kn} 

L i n n J 


i = l l—  n n 


=P(  V Irv  . t - np  >k  - np  1 

i=l  [Xi±Van]  n~  n n 


P(  1 rv  s i-nP  > C_na  ) 
.L,  [X. <x  -a  1 rn  3 n 

i = l 1 l—  n nJ 


where  p = F(x  - a ),  for  some  C,  > 0,  and  for  all  large  n,  since  a = of—) 
nnn  3 nvn 

and  since  F’  exists  and  is  bounded  away  from  zero  in  some  fixed  interval. 

Then  by  (2.4), 


-C2  n2  a 2 

P(X^J  < x - a ) < 2 exp  (— - — — — ) 

k n n — r 2np  + C na 

n n 3 n 

2 7 

-Cd  n“  a “ 

£ 2 exp  ( £ ) 


A. 

for  some  > 0 (not  depending  on  £)  and  for  large  n,  since  < —■  . But 
_i  9 o 

kn  n“  a *■  = (1  + e)  log  n,  so  that  if  e > 0 is  chosen  sufficiently  large,  we 
obtain 


(2.5) 


l r(4">  < - n - an>  < 

n>N  n 
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ln  a similar  way  we  have 


P'Xkn)  > *„  * V .1  ![X.>x  .a  ] - "pn  ’ C "*n> 

n i=l  inn4 


for  large  n,  where  now  p = 1 - F(x_  + a ).  Then  the  relations 
k n kn  n 

F (x  + a ) < — + C a and  a = o(— ) along  with  (2.4)  lead  to 
nn  n In  n n 

. -C  n2  a 2 

P(X,  > x + a ) < 2 exp  ( — ~ — — ) 

1 k n n — r k 


for  some  > 0 and  for  large  n,  so  if  e is  sufficiently  large, 


(2.6) 


l p(^n)  > xn  + an}  < 

n>Nj  n 


Combining  (2.5)  and  (2.6)  and  applying  the  Borel-Cantelli  lemma  completes  the 
proof. 


Proof  of  Theorem  1:  From  Lemmas  2.1  and  2.2  it  follows  that  with  probability 


|Fn(Xx(n)(w),  0))  - F(X^n)  (u))  - Fn(xn,  u)  ♦ ^ | = 0(n_1kn^  log^  n) . 
n n 


Since  F has  a bounded  second  derivative  in  (xq,  xq  + 6)  and  since  a^  = o(l). 
Lemma  2.2  gives 


F(X^n)  (eo))  = jp  ♦ (X^n)(w)  - xn)F'(xn)  + 0(n"2kn  log  n) 
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w.p.l.  Then  since  we  may  assume  (w.p.l.)  that  Fn(X^  (u),n))  = — , noting 

n 

that  F(xq)  = and  from  the  assumption  that  F' (xn)  is  bounded  away  from  zero 
for  large  n,  the  theorem  follows. 


3.  Further  Limiting  Properties 


If  {k^}  satisfies  the  slightly  more  severe  restriction  — — — 


oo,  then 


log  n 

we  may  verify  using  the  representation  (2.1)  that  under  the  assumptions  of  the 

last  section,  the  intermediate  order  statistic  is  asymptotically  normal 

k n 

with  mean  x and  variance * . For  this  we  merely  apply  the  ordinary 

n (nl ' (xn)) 

central  limit  theorem  for  triangular  arrays  to  ^ defined  by 


n , i 


k'1/z  (I 


[X.<x  ] n 
1 i—  nJ 


This  result  has  been  previously  obtained  by  Cheng  (1965),  without  assuming  the 

existence  of  the  second  derivative  F". 

Additionally,  we  may  use  (2.1)  to  establish  as  a new  result  a law  of  the 

iterated  logarithm  for  X^1^.  This  has  been  found  explicitly  by  Kiefer  (1972, 

n 

Theorems  5 and  6)  for  intermediate  order  statistics  from  i.i.d.  sequences  of 

r.v.'s  uniformly  distributed  on  the  unit  interval,  under  the  restrictions  that 
kn  kn 

k -*■  °°  and *■  0 monotonical  ly  and  *■  00 . Suppose  the  d.f.  F is  every 

n n y log  log  n 11  1 

where  continuous,  so  that  each  F(X.)  has  the  uniform  (0,1)  distribution.  Then 
translating  Kiefer's  Theorem  5 according  to  the  relation 


Fn(xn’  n J.  1 IF(X.  )<  — 

i = l 1 1 — n 


for  sufficiently  large  n,  we  obtain  from  (2.1)  the  following  result. 
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Thcorcm  2.  Let  {X^}  he  i.i.d.  with  continuous  marginal  d.f.  F satisfying  the 
conditions  stated  in  Section  2.  If  the  intermediate  rank  sequence  {k^}  satis- 
fies k f and  — 4-  0,  and 
n n 


kn  (log  log  n)2 

->•  CO 

T,  * 

log  n 


then 


PC 


lim  +_ 
n 


"•”<*»>  <xk")  - X ) 

n 

PT 

(2  kn  log  log  n) 


= 1)  = I- 


Finally  we  mention  that  an  asymptotic  approximation  to  the  second  moment 
of  the  remainder  term  defined  by  (2.1)  may  be  derived  using  a procedure  similar 
to  that  of  Duttweiler  (1973)  to  estimate  the  quantity  ER^2  for  given  in 
the  Bahadur  representation  (1.1).  To  do  this  it  is  somewhat  more  convenient  to 
deal  with  the  slightly  different  intermediate  order  statistic  representation 


(3.1) 


x£n)(u>) 


n 

n+1 


F (x * ,0)) 
n n J 


F’(x;) 


+ r;(w), 


k 

where  x'  satisfies  F(x')  = — r-  (for  large  n) . 
proofs  in  Section  2 that  also  R^(to)  = 0(n  1 k^ 
bility  one.  Then  under  the  restriction  that 
that  for  R^  defined  by  (3.1), 


It  is  easily  seen  from  the 
V 3/ 

4 log  4 n)  as  n •+  °°  with  proba- 
k 

_ _ -v  <x>  for  some  9 > 0,  we  have 
U 

n 


e(r;) 


2 _ 


IT 


* * 

n 


(nF ' (x^) ) 


as  n 


The  details  of  the  calculation  of  this  approximation  arc  provided  by  Watts 


(1977)  . 
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